NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Parallel kd-tree with Batch Updates

https://doi.org/10.1145/3709712

Men, Ziyang; Shen, Zheqi; Gu, Yan; Sun, Yihan (February 2025, Proceedings of the ACM on Management of Data)

The kd-tree is one of the most widely used data structures to manage multi-dimensional data. Due to the ever-growing data volume, it is imperative to consider parallelism in kd-trees. However, we observed challenges in existing parallel kd-tree implementations, for both constructions and updates. The goal of this paper is to develop efficient in-memory kd-trees by supporting high parallelism and cache-efficiency. We propose the Pkd-tree (Parallel kd-tree), a parallel kd-tree that is efficient both in theory and in practice. The Pkd-tree supports parallel tree construction, batch update (insertion and deletion), and various queries including k-nearest neighbor search, range query, and range count. We proved that our algorithms have strong theoretical bounds in work (sequential time complexity), span (parallelism), and cache complexity. Our key techniques include 1) an efficient construction algorithm that optimizes work, span, and cache complexity simultaneously, and 2) reconstruction-based update algorithms that guarantee the tree to be weight-balanced. With the new algorithmic insights and careful engineering effort, we achieved a highly optimized implementation of the Pkd-tree. We tested Pkd-tree with various synthetic and real-world datasets, including both uniform and highly skewed data. We compare the Pkd-tree with state-of-the-art parallel kd-tree implementations. In all tests, with better or competitive query performance, Pkd-tree is much faster in construction and updates consistently than all baselines. We released our code.
more » « less
Free, publicly-accessible full text available February 10, 2026
BYO: A Unified Framework for Benchmarking Large-Scale Graph Containers

https://doi.org/10.14778/3665844.3665859

Wheatman, Brian; Dong, Xiaojun; Shen, Zheqi; Dhulipala, Laxman; Łącki, Jakub; Pandey, Prashant; Xu, Helen (May 2024, Proceedings of the VLDB Endowment)

A fundamental building block in any graph algorithm is agraph container -- a data structure used to represent the graph. Ideally, a graph container enables efficient access to the underlying graph, has low space usage, and supports updating the graph efficiently. In this paper, we conduct an extensive empirical evaluation of graph containers designed to support running algorithms on large graphs. To our knowledge, this is the firstapples-to-applescomparison of graph containers rather than overall systems, which include confounding factors such as differences in algorithm implementations and infrastructure. We measure the running time of 10 highly-optimized algorithms across over 20 different containers and 10 graphs. Somewhat surprisingly, we find that the average algorithm running time does not differ much across containers, especially those that support dynamic updates. Specifically, a simple container based on an off-the-shelf B-tree is only 1.22× slower on average than a highly optimized static one. Moreover, we observe that simplifying a graph-container Application Programming Interface (API) to only a few simple functions incurs a mere 1.16× slowdown compared to a complete API. Finally, we also measure batch-insert throughput in dynamic-graph containers for a full picture of their performance. To perform the benchmarks, we introduce BYO, a unified framework that standardizes evaluations of graph-algorithm performance across different graph containers. BYO extends the Graph Based Benchmark Suite (Dhulipala et al. 18), a state-of-the-art graph algorithm benchmark, to easily plug into different dynamic graph containers and enable fair comparisons between them on a large suite of graph algorithms. While several graph algorithm benchmarks have been developed to date, to the best of our knowledge, BYO is the first system designed to benchmark graph containers.
more » « less
Full Text Available
ParlayANN: Scalable and Deterministic Parallel Graph-Based Approximate Nearest Neighbor Search Algorithms

https://doi.org/10.1145/3627535.3638475

Manohar, Magdalen Dobson; Shen, Zheqi; Blelloch, Guy; Dhulipala, Laxman; Gu, Yan; Simhadri, Harsha Vardhan; Sun, Yihan (February 2024, ACM)

Full Text Available
Parallel Longest Increasing Subsequence and van Emde Boas Trees

https://doi.org/10.1145/3558481.3591069

Gu, Yan; Men, Ziyang; Shen, Zheqi; Sun, Yihan; Wan, Zijin (June 2023, ACM)

Full Text Available
Many Sequential Iterative Algorithms Can Be Parallel and (Nearly) Work-efficient

https://doi.org/10.1145/3490148.3538574

Shen, Zheqi; Wan, Zijin; Gu, Yan; Sun, Yihan (July 2022, ACM Symposium on Parallelism in Algorithms and Architectures)

Full Text Available

Search for: All records